Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making

نویسندگان

Andrew Critch

Stuart J. Russell

چکیده

It is often argued that an agent making decisions on behalf of two or more principals who have different utility functions should adopt a {\em Pareto-optimal} policy, i.e., a policy that cannot be improved upon for one agent without making sacrifices for another. A famous theorem of Harsanyi shows that, when the principals have a common prior on the outcome distributions of all policies, a Pareto-optimal policy for the agent is one that maximizes a fixed, weighted linear combination of the principals' utilities. In this paper, we show that Harsanyi's theorem does not hold for principals with different priors, and derive a more precise generalization which does hold, which constitutes our main result. In this more general case, the relative weight given to each principal's utility should evolve over time according to how well the agent's observations conform with that principal's prior. The result has implications for the design of contracts, treaties, joint ventures, and robots.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making

Existing multi-objective reinforcement learning (MORL) algorithms do not account for objectives that arise from players with differing beliefs. Concretely, consider two players with different beliefs and utility functions who may cooperate to build a machine that takes actions on their behalf. A representation is needed for how much the machine’s policy will prioritize each player’s interests o...

متن کامل

Data Clustering of Solutions for Multiple Objective System Reliability Optimization Problems

This paper proposes a practical methodology for the solution of multi-objective system reliability optimization problems. The new method is based on the sequential combination of multi-objective evolutionary algorithms and data clustering on the prospective solutions to yield a smaller, more manageable sets of prospective solutions. Existing methods for multiple objective problems involve eithe...

متن کامل

Inputs and Outputs Estimation in Inverse DEA

The present study addresses the following question: if among a group of decision making units, the decision maker is required to increase inputs and outputs to a particular unit in which the DMU, with respect to other DMUs, maintains or improves its current efficiencylevel, how much should the inputs and outputs of the DMU increase? This question is considered as a problem of inverse data envel...

متن کامل

On the Optimal Solution Definition for Many-criteria Optimization Problems

When dealing with many-criteria decision making and many-objectives optimization problems the concepts of Pareto optimality and Pareto-dominance are inefficient in modelling and simulating human decision making. Different fuzzy-based definitions of optimality and dominated solution are introduced and tested on analytical test cases in order to show their validity and closeness to human decision...

متن کامل

Multi-objective Solution Approaches for Employee Shift Scheduling Problems in Service Sectors (RESEARCH NOTE)

Today, workforce scheduling programs are being implemented in many production and service centers. These sectors can provide better quality products and/or services to their customers, taking into account employees’ desires and preferences in order to increase sector productivity. In this study, an employee shift scheduling problem in the service sector is discussed. In the problem, the aim is ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

CoRR

دوره abs/1711.00363 شماره

صفحات -

تاریخ انتشار 2017

Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making

نویسندگان

چکیده

منابع مشابه

Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making

Data Clustering of Solutions for Multiple Objective System Reliability Optimization Problems

Inputs and Outputs Estimation in Inverse DEA

On the Optimal Solution Definition for Many-criteria Optimization Problems

Multi-objective Solution Approaches for Employee Shift Scheduling Problems in Service Sectors (RESEARCH NOTE)

عنوان ژورنال:

اشتراک گذاری